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Abstract — A cross-bifix-free code is a set of words in which 
no prefix of any length of any word is the suffix of any 
word in the set. Cross-bifix-free codes arise in the study of 
distributed sequences for frame synchronization. We provide a 
new construction of cross-bifix-free codes which generalize the 
construction in Bajic (2007) to longer code lengths and to any 
alphabet size. The codes are shown to be nearly optimal in size. 
We also establish new results on Fibonacci sequences, that are 
used in estimating the size of the cross-bifix-free codes. 

Index Terms — Cross-bifix-free code, Fibonacci sequence, Syn- 
chronization sequence. 



1. Introduction 

A crucial requirement to reliably transmit information in a 
digital communication system is to establish synchronization 
between the transmitter and the receiver. Synchronization is 
required not only to determine the start of a symbol, but also 
to determine the start of a frame of data in the received sig- 
nals. The initial acquisition of frame synchronization and the 
maintenance of this synchronization has been a widely studied 
field of research for several decades. Early works on frame 
synchronization concentrated on introducing a synchronization 
word periodically into the data stream 0, ifTTI . In the receiver, 
correlation techniques were used to determine the position of 
the synchronization sequence within the data stream. Massey 
introduced the notion of bifix-free synchronization word 
in order to achieve fast and reliable synchronization in binary 
data streams. A bifix-free word denotes a sequence of symbols 
in which no prefix of any length of the word is identical to 
any suffix of the word. 

The current methods for achieving frame synchronization 
at the receiver do not look at exact matching of the syn- 
chronization word. Instead the objective is to search for a 
word that is within a specified hamming distance of the 
transmitted synchronization word. This procedure allows for 
faster synchronization between the transmitter and the receiver 
0. Wijngaarden and Willink lfl4l introduced the notion of 
a distributed sequence where the synchronization word is 
not a contiguous sequence of symbols but is instead inter- 
leaved into the data stream. For instance the binary sequence 
110 * * is a distributed sequence where the symbol * 
denotes a data symbol that can take either of the values or 1 . 
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Wijngaarden and Willink lfT4l provided constructions of such 
sequences for binary data streams and studied their properties. 
Bajic et al. [T), showed that the distributed sequence entails 
a simultaneous search for a set of synchronization words. Each 
word in the set of sequences is required to be bifix-free. In 
addition there arises a new requirement that no prefix of any 
length of any word in the set should be a suffix of any other 
word in the set. This property of the set of synchronization 
words was termed as cross-bifix-free in HI, 0, lfl3ll . In the 
same works, the properties of sets of words that are cross- 
bifix-free were statistically analyzed. In this article we term 
the set of words which are cross-bifix-free as a cross-bifix-free 
code. In the above example of a distributed sequence the set of 
words {(1,1, 0,0, 0,0,0), (1,1, 0,0, 0,1,0), (1,1, 0,1, 0,0,0), 
(1, 1, 0, 1, 0, 1, 0)} forms a cross-bifix-free code. 

In a follow up work, Bajic (3| provided a new construction 
of cross-bifix-free codes over a binary alphabet for word 
lengths upto 8. This specific construction uncovers interesting 
connections to the Fibonacci sequence of numbers. In par- 
ticular, the number S(n) of binary words of length n, for 
3 < n < 8, which are cross-bifix-free satisfies the Fibonacci 



recursion 



S{n) = S(n-l) + S{n-2). 



It is noted in that although this construction gives larger 
sets compared to distributed sequences lfl4l for n < 8, the 
sizes of the sets are relatively smaller for lengths greater 
than 8. In a recent work Bilotta et al. 0] introduced a 
new construction of binary cross-bifix-free codes based on 
lattice paths, and showed that their construction attains greater 
cardinality compared to the ones in 0. 

In this work, we revisit the construction in Bajic 0. We 
give a new construction of cross-bifix-free codes that gener- 
alize the construction of in two ways. First, we provide 
new binary codes that are greater in cardinality compared to 
the ones in for larger lengths. In the process we discover 
interesting connections of the size of the codes obtained to 
the so-called fc-generalized Fibonacci numbers. Secondly, we 
generalize the construction to g-ary alphabets for any q, q > 2. 
To the best of our knowledge, this is the first construction of 
cross-bifix-free codes over alphabets of size greater than 2. The 
size of the generalized g-ary constructions are also related to 
a Fibonacci sequence, that we call the (q — l)-weighted fc- 
generalized Fibonacci sequence (see Section [2] for the exact 
definition). Using this relation to the Fibonacci sequences we 
analyze the asymptotic size of our construction. In the process 
of this asymptotic analysis, we generalize a result of Dresden 



2 



||5l on fc-generalized Fibonacci sequence to (q — l)-weighted 
fc-generalized Fibonacci sequence. The main asymptotic result 
on the size of cross-bifix-free codes that we prove is described 
in the theorem below. 

Theorem 1.1. Let C(n,q) denote the maximum size of a 
cross-bifix-free code of length n over an alphabet of size q. 
Then, 

,9-1 
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Note that the lower bound is within a constant factor of 
the best possible construction. The ratio between the lower 
and the upper bound increases towards 1 for larger alphabet 
sizes. In comparison, a similar ratio of the size of the binary 
codes constructed by Bilotta et al. 0), to the quantity 2 n /n, 
asymptotically goes to zero. 

The distributed sequences constructed in iTBl are also cross- 
bifix-free codes. In contrast to the work in Q, J4), they have 
an additional requirement that the number of fixed 0's and l's 
in the sequence are equal or nearly equal. Then the sequence 
is taken to be of the maximum possible length such that the 
above property still holds. The codes we construct do not have 
this restriction. Hence we do not compare the sizes of our 
codes to the sizes of the ones constructed in lfl4l . 

The rest of the paper is organized as follows. Our presen- 
tation is provided for general alphabet size q, q > 2, and the 
results for the binary alphabet are obtained as a special case. 
In Section [3] we provide the construction of the cross-bifix- 
free code and show that for the binary alphabet it is optimal 
for lengths n < 14, barring an exception at n = 9. In Section 
[4] we study the asymptotic behavior of the size of cross-bifix- 
free codes obtained from our construction. In particular, we 
prove Theorem. 11.11 stated above. Results on the behavior of 
the (q — l)-weighted fc-generalized Fibonacci sequence are 
also presented in this section. Lengthy calculations and some 
proofs are deferred to the Appendix. In the section below we 
introduce the basic notations and definitions required. 

2. Notations and definitions 

Let Z g = {0, . . . , q — 1} be an alphabet of q elements. We 
denote by Z* all the nonzero elements of the set Z 9 , that is, 
Z* = Z g \ {0}. A consecutive sequence of m elements b € Z q 
is denoted by the short form b m . As an example, the word 
(0, 0, 1, 1, 1, 0, 1) is represented in short as (0 2 , l 3 , 0, 1). For 
convenience, if m = then b rn is used to denote the absence 
of any element. 

Definition 2.1. For a word u e Z™, a word v is called a prefix 
of u if we can write u as u = (v, w), for some word w. The 
word w is called a suffix of u £ Z™ if we can write u as 
u = (v, w), for some word v. 

For any word u we only consider prefixes and suffixes 
which have length strictly less than the length of u. 

Definition 2.2. A word u e Z™ is called bifix-free if the prefix 
of any length of the word is not a suffix of the word. 



Definition 2.3. A cross-bifix-free code is a set of words in Z™ 
which satisfy the property that the prefix of any length of any 
word is not the suffix of any word in the set, including itself. 

We denote the maximum size of a cross-bifix-free code by 
the notation C(n, q). 

Definition 2.4. The (q—l)-weighted k-generalized Fibonacci 
sequence is a sequence of numbers which satisfies the recur- 
rence relation 

k 

F Kq (n) = ( ? -l)£> fcl9 (n-0, 
i=i 

for some initial values of F k , 9 (0), . . . , F k , q (k — 1). For q = 
2, the sequence obtained is called a k-generalized Fibonacci 
sequence. For q = 2, k = 2, and the initialization ^2.2(0) = 
lj-p2,2(l) = 2, we obtain the usual Fibonacci sequence. 

The (q — l)-weighted fc-generalized Fibonacci sequence is a 
special case of the weighted k-generalized Fibonacci sequence 
which satisfies the recurrence relation Q, lfl2l 

F k (n) = ai F k (n - 1) + a 2 F k (n - 2) + • • • + a k F k (n - fc), 

where the weights are given by ax, 0,2, ■ ■ ■ , a k £ Z, and Z 
denotes the integers. Setting all the weights equal to q — 1 
gives the sequence in the above definition. 

The (q — l)-weighted fc-generalized Fibonacci sequence 
arises in the study of cross-bifix-free codes as described in 
the section below. 

3. A Construction of Cross-Bifix-Free Codes 

In this section we provide a general construction of cross- 
bifix-free codes over the g-ary alphabet. Interestingly, the sizes 
of our construction are related to the (q — 1) -weighted fc- 
generalized weighted Fibonacci numbers F k ^ q {n). The initial- 
ization on F k ^ q (n), < n < k — 1 that we use is given as 



F k>q (n) = q n , n = 0,...,fc-l. 



(1) 



Below, we describe the family of cross-bifix-free codes in 
the space Z™. The family is obtained by varying the value of fc. 

The construction: For any 2 < fc < n— 2, denote by § kyq (n) 
the set of all words (si, S2, ■ ■ ■ , s n ) in Z™ that satisfy the 
following two properties: 

(i) si = S2 = ■ ■ ■ = s k = 0, Sfc+i 7^ and s„ ^ 0, 

(ii) the subsequence (s k + 2 , Sfc+3, . . . , s T i-i) does not contain 
any string of fc consecutive 0's. 

This construction implies that § k . q (n) contains all possible 
words of length n that start with fc zeroes, end with a nonzero 
element, and have at most fc — 1 consecutive zeroes in the last 
n— 1 coordinates. In the remaining part of this section we show 
that for every fc, k = 2, . . . , n — 2, this set of words forms 
a cross-bifix-free code. We determine its size in terms of the 
Fibonacci sequence. First, in the theorem below, we show that 
&k,q( n ) is a cross-bifix-free code. Additionally, we show that 
the code § k , q (n) has the property that it can not be expanded 
while preserving the property that it is cross-bifix-free. That 
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is, for every word x G Z™ \ §k, q (n), the set {x} U §k, q (n) is 
not cross-bifix-free. 

Theorem 3.1. For any k, 2 < k < n — 2, the set §k, q (n) is 
a nonexpandable cross-bifix-free code. 

Proof: To see that §fe g (n) is a cross-bifix-free code, note 
that the prefix of any word of §fe g (n) starts with k consecutive 
zeroes. But in the last n—1 coordinates of any word, we have 
at most k — 1 consecutive zeroes, and the last coordinate is 
always nonzero. Thus, no prefix of any length of any word 
can match any suffix of itself or of any other word in , 9 (n). 

To show that §k.q(n) is nonexpandable we consider all 
the possible configurations of words that could be appended 
to the set Sfe j9 (n). First we note that we can not append 
any word starting with a nonzero element since the nonzero 
element occurs in the last coordinate of some word in §fe i9 (n). 
Similarly, we can not append any word ending with a zero 
element. The other possible configurations of words that we 
need to consider are as follows. 

• Let s be a word which contains at least k consecutive 
zeroes in the last n — 1 coordinates. We consider the 
suffix in s that starts with the last set of k consecutive 
zeroes and contains at most k — 1 consecutive zeroes 
following it, that is, the suffix has the form (0 fc ,a, u), 
where a is nonzero and u is a vector of length m that 
has at most k — 1 consecutive zeroes. Then the word of 
length n (0 fc , a, u, i™-™-^- 1 ) j s a word in §k, q (n) and 
has a prefix matching a suffix of s. Thus s can not be 
appended to :q (n). 

• Let s be a word which contains a prefix of at most 
k — 1 zeroes followed by a nonzero element, that is 
s = (0 Z , a, u), where < I < k — 1, a is nonzero, and u 
has length n— I — 1. It is readily seen that (0 l ,a) is also 
the suffix of the word (0 fc , l™-^'- 1 , ( , a) in § fc ,g(n). 
Hence, such a word can not be appended to §k, q (n). 

Thus, no additional word can be appended to the set §^ g (n) 
while still preserving the cross-bifix-free property. ■ 

The nonexpandability of the construction above parallels 
the nonexpandability of the cross-bifix-free codes obtained in 
Bajic |3|] and Bilotta et al. (4). However, note that the non- 
expandability does not automatically indicate the optimality 
of the construction, as is evident from the many values of k 
for which the nonexpandability holds true. In the following 
sections, we instead show that the largest sized set obtained 
by optimizing over the value of k, k = 2, . . . , n — 2, differs 
(in ratio) from the size of the optimal code by only a factor 
of a constant 2(q — l)/(qe). 

We first describe a recursive construction of the set §k, q (n) 
in terms of the sets §k,q( n — 0> ^ = 1, . . - , fe. This recursive 
construction immediately establishes the connection to the 
Fibonacci recurrence and helps us determine the size of the 
set in terms of the Fibonacci numbers. 



Theorem 3.2. 

r{(0 fe ,a,s,/3) la./JeZ^eZ-"}, 
I fc + 2<7i<2fc+l, 

k ' q ~ I Uti {M'-\«) : s G $ k , q (n-l),ae Z*}, 

[ 2£; + 2<n. 

Proof: For n = k + 2, . . . , 2k + 1, the coordinates 
Sfc+2, ■ ■ ■ , s n _i necessarily have at most n — k — 2 < k zeroes 
and hence can contain all the words of length n — k — 2. This 
establishes the result for n = k + 2, . . . , 2k + 1. 

Now, let n > 2k + 2. For brevity denote each set on the 
RHS of the equation in Theorem 13.21 by 

T, 4 {( S , 0'-\ a) : s G S M (n - I), a € Z*}. 

Note that the sets T; are mutually disjoint for different I 
since the last I + 1 coordinates have different structure for 
the different sets. To show that §k.q(n) C U/T/, note that any 
element u G §k, q {n) has at most k — 1 zeroes in the last 
n — 1 coordinates and hence the word u must be of the form 
u = (O fe ,u fe+ i,...,u„_/_i,0' _1 ,a) where u„-i-i,a £ Z* 
and I G {1, . . . , k}. Thus, u G T/. 

To show the reverse inclusion, let I G {1, . . . , k} and let 
s e §k, q (n — I)- Note that s ends with a nonzero element. The 
word (s, 0' _1 ,a) where a G Z*, starts with a sequence k , 
ends with a nonzero element and has at most k — 1 consecutive 
zeroes in the last n — 1 coordinates. Hence (s, 0' _1 ,a) G 
§k,q(n) and the set {(s, O' _1 ,o;) : a G Z*} is a subset of 
§k,q(n). Hence, T; C Sf.,q( n ) f° r every I = 1, . . . , k. ■ 

Corollary 3.1. The cardinality of §k ,q{n) for n > 3 is given 
by the equation 

Sk, g (n) - |Sfc,,(n)l = (l - l ? F k, q {n -k-2). 
Proof: For n = fc + 2,...,2fc + l, the corollary can 
be readily verified from the expression in equation (Q~|) and 
Theorem 13.21 We use an induction argument for n > 2k + 2. 
Assume that the corollary is true for n < N where TV > 2A:+2. 
We get, 

k 

Sk,q( N ) = I { ( S ' ' _1 ' : S G §k ^ N ~ 0» a e K)\ 

1 = 1 
k 

= J2(q-l)S k , q (N-l) 
i=i 

k 

= J2(Q-^-^) 2 Fk. q (N-l-k-2) 
i=i 

= (q-l) 2 F kiq (N -k-2). 

We used the induction argument in the second last step. This 
proves the corollary. ■ 
For fixed n and q, the largest size of the set §fc g (n) can 
be obtained by optimizing over the choice of k. Let S(n,q) 
denote this maximum. It is given by the expression 

S(n,q) = max{(q-l) 2 F k . q (n-k-2) : 2 < k < n-2}. (2) 

In particular, the size S(n,q) is upper bounded by the 
maximum cardinality C(n, q) of a cross-bifix-free code, as 
stated in the corollary below. 
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Corollary 3.2. For any n > 3, 

C(n,q)>S(n,q). 

A. Sizes of cross-bifix-free codes for small lengths 

The size of binary cross-bifix-free codes obtained in 
Bilotta et al. (4) is obtained by counting lattice paths, in 
particular, Dyck paths. 

Theorem 3.3 (Bilotta et al. flD). Let B{n) denote the size 
of a binary cross-bifix-free code of length n. For m > 1, let 
C m = -^tt ( 2m ) denote the ?n-th Catalan number. Then 

m+l V m J 



B(n) 

'a 



in i 

rn/2 



^ ^ C% C m — i j 

i=0 

(m+l)/2 

^ ' CiC n 



r 2 

°(m-l)/2> 



n = 2m + 1, m > 1, 



2m + 2, m odd, 



n = 2m + 2, m even. 



For values of n < 14, it is verified by numerical com- 
putations that the sizes obtained by our construction are all 
optimal, except for the value n = 9. In particular we get the 
Table J] of values for 3 < n < 30. The first column gives the 
value of the word length n, the second column shows the sizes 
of the codes obtained in Bilotta et al. O, the third column 
gives the sizes obtained from our construction after optimizing 
over different values of k. Finally, the last column gives the 
values of fc for which Sk, q (n) achieves the maximal size in 
the third column. The numbers in bold denote that the sizes 
are optimal. 

TABLE I 

Table comparing the values from (4) with equation {2) 



n 


(4) 


Eq. (2) 


k 


3 


1 


1 




4 


1 


1 


2 


5 


2 


2 


2 


6 


3 


3 


2 


7 


5 


5 


2 


g 


8 


8 


2 


9 


14 


13 


2 


10 


23 


24 


3 


11 


42 


44 


3 


12 


72 


81 


3 


13 


132 


149 


3 


14 


227 


274 


3 


15 


429 


504 


3 


16 


760 


927 


3 



n 


14] 


Eq. (2) 


k 


17 


1430 


1705 


3 


18 


2529 


3136 


3 


19 


4862 


5768 


3 


20 


8790 


10671 


4 


21 


16796 


20569 


4 


22 


30275 


39648 


4 


23 


58786 


76424 


4 


24 


107786 


147312 


4 


25 


208012 


283953 


4 


26 


380162 


547337 


4 


27 


742900 


1055026 


4 


28 


1376424 


2033628 


4 


29 


2674440 


3919944 


4 


30 


4939443 


7555935 


4 



The optimality of the values for n < 14 is proved computa- 
tionally by setting up a specific program that searches for the 
largest clique in a graph. The graph consists of vertices which 
correspond to the set of all words in Z?J that are bifix-free. An 
edge exists between two vertices, i.e., two words, if they are 
mutually cross-bifix-free. The algorithm MaxCliqueDyn (6) 
is used to determine the maximum size of the clique in the 
graph. This algorithm shows that the values denoted by bold 
in Table U are optimal. 

Note that our construction has larger size than the construc- 
tion in J4] for all values of n, 13 < n < 30. This trend is 



observed asymptotically too, as we describe in the following 
sections. 

4. Near optimality of the size S(n, q) 

In this section we show that the size S(n, q) is close to 
the optimal size C(n,q). The ratio S(n,q)/C(n,q) measures 
how close the construction in Sectionals to the optimal value. 
The following theorem gives an asymptotic lower bound on 
this ratio. 



Theorem 4.1. The following limit holds: 



lim inf 



S(n, q) 



> 2- 



1 



(3) 



n->oc C{n, q) qe 

This lower bound is proved by showing a lower bound on 
S(n, q) and an upper bound on C(n, q). The derivation of the 
lower bound on S(n, q) crucially depends on the properties 
of the (q — 1) -weighted fc-generalized Fibonacci sequence of 
numbers. We digress in the next subsection to first establish 
these needed properties. 

A. Properties of the Fibonacci sequence Fk >q {n) 

Levesque Q showed in a very general context that to every 
weighted fc-generalized Fibonacci sequence of numbers we 
can associate a characteristic polynomial (see Theorem IA.1I 
in the Appendix). For the (q — l)-weighted fc-generalized Fi- 
bonacci sequence, this polynomial specializes to the following 
form 



/Or) = x k -(q-1) 



fc-i 

i=0 



(4) 



Below, we state the properties of this polynomial and of the 
corresponding Fibonacci numbers. The initialization sequence 
that we use is the one described in equation ((TJ. The proofs 
in this section are omitted for clarity of presentation and are 
instead provided in Appendix. 

Proposition 4.1. The polynomial f(x) has distinct roots with 
a unique real root a = a(k, q) outside the unit circle. The root 
a lies in the interval (l,q). 

The value of the root a is in fact close to q. An estimate 
of this root is given by the following lemma. 

Lemma 4.1. There exists a number K q such that the following 
holds. For all fc > K q , there exists a (3 = (3(k, q) in the interval 



-, q) such that 



<1 



g-1 

/3 k 



< a < q. 



(5) 



Finally, the Fibonacci numbers can be expressed in terms 
of this real root a. Let [x] denote the integer closest to the 
real number x. 

Proposition 4.2. Let q > 2. The n-th number in the (q — 1)- 

weighted fc-generalized Fibonacci sequence is given by the 
expression 

(a - l)a n+1 



F k , q (n) 



(q+(k+l)(q-a))(q-l) 
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We note here that Proposition 14. 11 is a generalization to q > 
3 of the result obtained by Miles (9) for q = 2. We adopt 
a technique similar to the one in Miller (TOj. Additionally, 
Proposition 14.21 is a generalization of the result in Dresden Q 
to (q — 1) -weighted fc-generalized Fibonacci numbers, for q > 
3. For q — 2, the expression above reduces to the expression 
for the sequence Fk,2(n>) as obtained in @. 

B. A Lower Bound on S(n, q) 

Using the properties of the Fibonacci numbers from the 
previous subsection, we establish an asymptotic lower bound 
on the size S(n, q). 

Theorem 4.2. The asymptotic size S(n, q) satisfies the limit, 



liminf ^)>9- 1 



q n /n 



qe 



(6) 



Proof: Using Corollary 13.11 and Proposition 14.21 in suc- 
cessive steps we obtain, 

S(n,q) > n(g-l) 2 F M (n-fc-2) 



q n /n 



> 



(q+(k+l)(q-a))(q-l) 



where the term o(l) ->0asn->txi.To derive the asymptotics 
we choose n as an increasing function of k: 

n = n(k) = [ca fe ] , 

where c is a positive constant. Note that a is also a function 
of k. We obtain, 




The last term in the RHS of the equation above can be further 
lower bounded by using Lemma 14.11 We assume that there 
exists a number K q and k > K q , as required by the lemma. 



> c 1 - 



q-1 



cq 



g-1 

qP k 



c/3 



The RHS of the above equation tends to ce 



as k 



since (l + — > e as x — > oo and P(k,q) — > q as fc — > oo. 

e(g-l) 

The term ce <j attains a maximum of -, — when c = 
q/(q—l). The theorem follows by substituting this value into 
the lower bound above. ■ 



C. An upper bound on the maximum size C(n, q) 

An upper bound for the maximum size of a cross-bifix- 
free code is readily obtained from the study of the statistical 
properties of such sets in the data stream. The main object 
of study is the time when the search for any word of the 
cross-bifix-free code in the data stream returns with a positive 
match. Bajic et al. |fl~), OH establish the probability distribution 
function of this time, the expected time duration for a match, 
and the variance of this distribution. In particular, for a cross- 
bifix-free code, the variance a 2 of the time for the first match 
is given by the expression J2] eq. (18)] 

n In 

a =(l-2n)- + — , 

where M is the size of the cross-bifix-free code. Using the 
property that the variance is always nonnegative immediately 
gives us the required upper bound on any cross-bifix-free code. 
In particular, we have the theorem 

Theorem 4.3. Let C(n, q) denote the optimal size of a cross- 
bifix-free code in Z™. Then, 

n 

It is surprising that this bound, which is quite immediate, 
was not noted in the previous works on the size of the cross- 
bifix-free codes. Combining Theorem 14.21 and Theorem 14.31 
we obtain Theorem 14. II 



D. Comparison to earlier results 

To compare with the new construction of binary cross-bifix- 
free codes in Bilotta et al. (4), we look at the asymptotics of 
their construction for large n. From Theorem 13.31 we obtain 
that 

C m , n = 2m + 1 , 



B(n) 



< C m +i, n = 2m + 2, 



where C„ 



—^r ( 2m ) is the m-th Catalan number and m > 
1. Using Stirling's approximation, we get that the number C m 
is approximately, 



C„ 



1 



(2m+l)C„ 



2m + 1 



Thus for n odd, 

B(n) 

2™/n 2 2m+1 2{m+l)^/¥m' 

which goes to zero as 2m + 1 = n —> oo. Similar conditions 
hold for the case n = 2m + 2. Thus the construction in J4] is 
a negligible fraction of 2™/n, in contrast to the nearly optimal 
construction described in the previous section. 

5. Conclusion 

We provided a new construction of cross-bifix-free codes 
that are close to the maximum possible size. The construction 
for the binary codes is showed to be larger than the previously 
constructed codes for all lengths n < 30, barring an exception 
at n = 9. We also provided the first construction of q-ary 
cross-bifix-free codes for q > 2. In the process, we established 
new results on the Fibonacci sequences, generalizing some 
earlier works on these sequences. 
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Appendix 

In this appendix, we provide the proofs of the results on 
the Fibonacci sequences that are stated in Section 14-AI First, 
we recall a very general theorem on weighted fc-generalized 
Fibonacci sequences proved in Levesque 171 . 

Theorem A.l (Levesque 0). Let fc > 2. Let Fk(n) be defined 
by the following recurrence relation, 

fc 

Fk{n) = E a % F k (n -i), for n > fc, 

for a t G Z, i = l,...,fc, and with the initial conditions, 
-Ffc(O), Fk(l), . . . ,F k (k - 1). Additionally, suppose that the 
characteristic polynomial h(x) associated with the sequence 

W»)}£o. 

fe-i 

h(x) = x k -^a k ^x\ 

i=0 

has distinct roots 71, 72, • • • , 7fc. Then, for n > k, the values 
of -Ffc(n) are given by the expression 

fc-i 



3=0 



where, 



v Q = u(0), 

3 

v j = U U) ~ E a Mj ~ *)> for 1 < j < fc - 1, 

i=l 

k k—l+j 

ft= SW forj - L 

For m = q — 1, j = 1, . . . , fc, we obtain the corresponding 
expressions for the (g — l)-weighted fc-generalized Fibonacci 
numbers. In particular, the polynomial h(x) reduces to the 
polynomial f(x) defined in equation ©. 

We proceed with the proofs of the propositions in Sec- 
tion 14-AI In order to prove Proposition 14.11 we first establish 
two lemmas below. Define a polynomial g{x) as 

g(x)±(x-l)f(x)=x k (x-q) + (q-l). 

Lemma A.l. The polynomial f(x) has a real root in the 
interval (1, q). 

Proof: This follows from the fact that /(l) = 1 — k(q — 
1) < and /(g) = g(q)/(q - 1) = 1 > 0. " ■ 

Lemma A.2. Let a = a(k, q) be the real root of f{x) in (1, q). 
Then the polynomial g(x), and consequently the polynomial 
f(x), satisfies the following inequalities. 

(i) g(x) > for x <E (a, 00), 

(ii) g(x) < for x € (l,a). 

Proof: Observe that 

g'(x)=x k - 1 ((k + l)x-kq), 

and so g'(x) < for x <G [1, kq/(k + 1)) and g'(x) > for 
x G (fcV(fc + l)), q}. Since g(l) = g(a) = 0, g(kq/(k + l)) < 
and g(q) > 0, the lemma follows. ■ 



Next, we establish Proposition 14. 1 



Proof of Proposition \4.1\ First, we show that the roots 
of g(x), and hence of f(x), are distinct. Indeed, g'(x) = 
if and only if x = or x = However, g(0) ^ and 

g (*qrr) 7^ 0. Therefore, the roots are distinct. 

Next, let 7 be a root of f(x) with 7 / a. We prove by 
contradiction that |7| < 1. We consider the two cases I7I > \a\ 
and I7I < \a\ separately. Suppose I7I > |a|. Since j k = 

(9- l )YllZll\ we get 

fe-i fe-i 

i7i* = i(9-i)E^i^(«- 1 )EH i ' 

i=0 i=Q 

and so, /(I7I) < 0, contradicting part (i) of Lemma |A. 2 1 

Next, suppose |-y | G [1, \a\]. Since 7 is also a root of g(x), 
ql k = 7 fe +! + (q- 1). Then 

9l7| fc = l7 fe+1 + (9-l)l<l7| fc+1 + (g-l), ( 7 ) 

which implies that .<? ( | T I ) > 0- Then by part (ii) of Lemma 
IA.21 | 7 | G {1, \a\} and equality in (UJ holds. Hence, j k+1 and 
7 fc are real, implying that 7 is real. Since the roots of g(x) 
are distinct, 7 G {-1, -a}. But g(-l) = (-l) k+1 {q + 1) - 
{q - 1) + and fl (-a) = 2a k q+ (1 + - 1) + 0, 

contradicting the fact that 7 is a root of ,g(a;). 

Therefore, a is the only root outside the unit circle. By 
Lemma |A. II a is in the required interval. ■ 



Proof of Proposition \4.2\ We apply Lemma I A. 1 1 with 
dj = q — 1, i = 1, . . . , k and with h(x) = f(x). Observe that 
vq = vi = ■ ■ ■ = Vk-i = 1. We obtain, 

fc-i 

F k,q( n ) = E P "-J 
3=0 

k-1 k k-l+n-j 

/'(-») 

= V 7t " TV 

i=l J ^ j=0 



E ? 



It 



( 7f(7j - 1) \ / 7 8 fc 

^^(g+(/ C + l)(7 i -g))7f- 1 y | U-l 

E 



7.r +1 (7*-i) 



^( ? -l)( ?+ (fe + l)( 7i -g)) 

To obtain the fourth step we used the fact that 7^'s are roots of 
fix). Without loss of generality, let 71 be the a(k, q) defined 
in Proposition 14.21 and so | 7 j| < 1 for i > 2. We note that 
q + (fc + l)(7i — q) = —kq + (fc + l)7j. We get the following 
sequence of inequalities for q > 3. 



7 



E 



7r x (7i-i) 



^ (g-l)(g + (fe + l)( 7j -g)) 



<EK" +1 I 

i=2 
ft 



(7 - 1) 



< 



< 



E 



(g-l)(g + (fc + l)(7i-?)) 
l7i~l| 



^ (g-l)|Asg-(A; + l) 7i | 

2 



i=2 
k 



E — 



.^g-lfc(g-l)-l 



< 



where the second last step is obtained by observing that for 
i > 2 we have the inequalities | 7 — 1| < 2 and |fcg — (k + 
l)7i| > kq — (k + 1). The last step is obtained by applying 
the inequality q > 3. This completes the proof for g > 3. The 
proof for q = 2 is present in Dresden J5). ■ 



Finally, we prove Lemma 14. II Again, for brevity, we denote 
a(k,q) and /3(k,q) by a and f3 respectively. 
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Proof of Lemma \4~7\ Observe that g(q—l/q k 1 ) < if 
and only if 

(1 - l/q k ) k > (1 - I/?). 

Since (1 — \/q k ) k — > 1 as k — > oo, there exists a constant K q 
such that g(q - l/q^ 1 ) < for all k > K q . 

Hence, for all k > K q , there exists j3 in the interval (q — 
l/q k ~ 1 ,q) such that g(j3) < 0. We claim that /9 satisfies © by 
showing that g(/3) < implies that g(q — 1//3 ) < 0. Indeed, 
since 



g(J3) = p k (/J-9) + (q - 1) < 0, 



we get 



f3<q- 



g 9 



;3 k 



< 0. 



Since g{a) = 0, we get q — ^p- < a < q. 



+ (?-!) 
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